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The observed number counts of high-redshift galaxy candidates^"* have been used to 
build up a statistical description of star-forming activity at redshift z ^ 7, when galaxies 
reionized theUniverse^'^'^'^°. Standard models^^ predict that a high incidence of gravi- 
tational lensing will probably distort measurements of flux and number of these earliest 
galaxies. The raw probability of this happening has been estimated to be ~ 0.5 per cent 
(refs 11, 12), but can be larger owing to observational biases. Here we report that grav- 
itational lensing is likely to dominate the observed properties of galaxies with redshifts 
of z ^ 12, when the instrumental limiting magnitude is expected to be brighter than the 
characteristic magnitude of the galaxy sample. The number counts could be modified 
by an order of magnitude, with most galaxies being part of multiply imaged systems, 
located less than 1 arcsec from brighter foreground galaxies at z ^ 2. This lens-induced 



association of high-redshift and foreground galsixies has perhaps already been observed 
among a sample of galaxy candidates identified at z 10.6. Future surveys will need to 
be designed to account for a significant gravitational lensing bias in high-redshift galaxy 
samples. 

Along random lines-of-sight, the raw probability (or optical depth) for multiple imaging of objects 
at high redshifts — owing to gravitational lensing by individual foreground field galaxies-'^^'-'^^ — is 
c± 0.5%. However, all galaxy populations are observed to have a characteristic luminosity (L^), 
brighter than which galaxy numbers drop exponentially and below which numbers rise with a very 
steep power-law slope^'^'^. The potential for gravitational lensing to modify the observed statistics 
therefore increases dramatically, owing to the magnification of numerous, intrinsically faint galaxies 
to observed fluxes that are above the survey limit. This effect, which is known as magnification bias^^, 
leads to an excess of gravitationally lensed galaxies among flux-limited samples. Magnification bias 
is expected to be particularly signiflcant at high redshifts {z ^8), where current observations may 
only be probing the exponential tail of the LF^, so that the number density could be rising very 
rapidly towards the detection limit. Indeed, multiply imaged candidates at z ^ 7 have already been 
discovered behind foreground clusters via targeted searches^'^"^^, demonstrating this to be an efficient 
method for finding faint high redshift galaxies^^'^^. 

We assess magnification bias among high redshift galaxies assuming singular, spherical, isothermal 
gravitational lenses, which produce one or two images, and designate the apparent magnitude of the 
more magnified image (the only image in the absence of lensing) as itiab,!- We then calculate, as 
a function of the assumed characteristic luminosity (expressed in terms of absolute magnitude M^), 
the fraction of galaxies brighter than the magnitude limit (miim) for the Hubble Ultra Deep Field 
(HUDF) that would be multiply imaged (designated -Fiens)- Such curves are shown at 2; = 6, 7, 8.6 
and 10.6 in panel a of Figure 1. The superimposed solid and open points correspond to lens fractions 
for different estimates^'^ of Mi, at these redshifts. At 2; 2± 6— 7, we expect only ~ 1 percent of galaxies 
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to be lensed. At ^; 2± 8 — 10, however, we expect a lensed fraction of a few to a few tens of percent, 
depending on the true value of M^. Note that since current survey hmits are significantly fainter 
than at z 2± 6 — 7, the lens fraction is quite insensitive to M*. However, at higher redshifts where 
the survey limits might be much closer to M*, the lensing fraction is very sensitive to its uncertain 
value. 

Predictions for a significant lens fraction at z ^ 8 stand in apparent contrast to the fact that 
no image pairs have been identified in the HUDF. However, we find the probability that a multiply 
imaged galaxy, with observed mAB,i has a corresponding second image with mAB,2 < "T-iim (i-e. 
detectable with the HUDF data) to be only 2± 10%, even for galaxies that are one magnitude 
brighter than mum (see Supplementary Information). Thus, as shown in panel b of Figure 1, the 
fraction of galaxies (-Fmuit) that are detected as multiply imaged systems in the HUDF is an order 
of magnitude lower than the true lensed fraction. Although this fraction would increase somewhat 
if elliptical lenses were included in our analysis, multiply imaged systems are not expected to be 
observed in the current data. On the other hand, magnification bias also leads to a concentration 
of high redshift sources — both singly and multiply imaged — around foreground galaxies^^~^°. 
The resulting correlation between high redshift candidates and bright foreground galaxies therefore 
offers an alternative avenue to observing the effect of gravitational lensing. A schematic diagram 
illustrating this point, as well as magnification bias, is included as Supplementary Figure 1. 

To quantify this correlation, we first determine the distribution of separations between random 
lines-of-sight and the nearest bright (H < 25 mag) foreground galaxy in the HUDF, measured as the 
angular distance to the centroid. This is shown by the dotted black line in panel c of Figure 1. This 
distribution can be compared to the predictions of our model (dashed line in panel c of Figure 1). If 
the candidate sample consists of both multiply imaged and unmagnified galaxies, then the observed 
distribution of separations should be a weighted sum of the random and the lensed line-of-sight 
distributions. The correct weighting is the probability for gravitational lensing, -Fiens- Two examples 
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are shown in panel c of Figure 1. The fraction of galaxies found within 2± 1 — 2 arcseconds of a 
foreground galaxy is very sensitive to the characteristic luminosity if ^ — 19 mag, providing a 
potential observable for the influence of lensing on the number counts of z ^ 8 candidates. 

For comparison with the lensing predictions, we have measured the distribution of separations 
between a sample of z ~ 10.6 candidates^ and their nearest bright {H < 25 mag) foreground galaxy. 
Comparing the distributions, we find that these candidates are observed to be closer to bright 
foreground galaxies than are random lines-of-sight. On the other hand, the candidates are found 
at larger separations from foreground galaxies than would be predicted if they were all multiply 
imaged. Quantitatively, the Kolmogorov-Smirnov probabilities between the observed distributions 
and the all-random model or the all-lensed model (see Supplementary Information) indicate that 
both models are rejected at high significance. This suggests that a fraction of candidates may be 
gravitationally lensed. Moreover, we have generated the distribution of rcdshifts for foreground 
galaxies found within < 1.5 arcseconds of the z ~ 10.6 candidates. These distributions are 
consistent with the distribution of gravitational lens redshifts, while the redshift distribution of all 
bright foreground galaxies are not, which supports the hypothesis that foreground galaxies are lensing 
a fraction of the z ~ 10.6 candidates into the observed sample. 

With the introduction of the James Webb Space Telescope {JWST), galaxy surveys will be 
undertaken out to even higher redshifts, well into the epoch of First Light^^. Panels a and b of 
Figure 2 show Fi^ns as a function of out to z = 20. The flux limits correspond to an ultra-deep 
survey (mnm = 31.4 mag), and a medium-deep survey (miim = 29.4 mag). The evolution of the 
characteristic luminosity is unknown at these unexplored redshifts. For comparison, we therefore 
plot squares corresponding to estimates of based on an extrapolation from lower redshift HUDF 
data-^. Figure 2 shows that in ultra-deep JII^S'T surveys for First Light objects at ^; ^ 14, more than 
Fiens ~ 10% of the candidates could be lensed. In much shallower JWST surveys that only sample 
the exponential tail of the Schechter LF, a lensed object fraction of Fi 

ens ^ 10% could be seen at 
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redshifts as low as z ~ 8-10. However at z ^ 14, the lensed fraction in such surveys could be much 
higher, and may even represent the majority of observed galaxies. Surveys with JWST will therefore 
need to be carefully planned and analyzed to account for the influence of foreground lensing galaxies. 

As in the case of the HUDF the fraction of galaxies that will be detected as multiply imaged 
systems by JWST is significantly lower than the true multiple image fraction. However, as the 
multiple image fraction becomes very large at high redshifts, observed doubles could become common; 
larger than -Fmuit ~ 10% at redshifts z ^ 12 in a medium-deep (mAB < 29.4 mag) JWST survey, and 
z <^ 16 in an ultra-deep (mAB < 31.4 mag) survey. Panels e and f present the predicted distributions 
of separation for galaxies discovered by JWST from bright foreground galaxies. If the observed 
evolution in continues to higher redshift, then the spatial distribution of high redshift galaxies 
relative to foreground galaxies will depart from random at redshifts z 14 for ultra-deep surveys, 
and at z ^ 10 for medium-deep surveys with JWST. A crucial prediction is that the majority of very 
high redshift galaxies discovered with JWST may be located less than 1 arc-second from a bright 
foreground galaxy, and will have been gravitationally magnified into the sample. 

A key goal for JWST will be to measure the number counts of high redshift candidates, and 
to construct luminosity functions (LF) in order to build up a statistical description of star-forming 
activity in galaxies. LFs describing the density of sources per unit luminosity are parametrised by 
a Schechter function^^, "^{L) oc (L/L^)" exp (— L/L^)l/L^, including free parameters for the power- 
law slope at low luminosities (a), and the characteristic absolute AB-magnitude [Mab — = 
—2.5 logj^o(-^/-^*)] brighter than which galaxy numbers drop exponentially. Importantly, gravitational 
lensing has the potential to significantly modify the observed LF from its intrinsic shape^^. In 
particular, at very high luminosities in the exponential tail of the Schechter function, the LF shape 
can be modified from exponential to power-law, since gravitational lensing magnifies numerous faint 
sources to apparently higher luminosities. Figure 3 shows that the shapes of LFs near the flux limit 
are not affected by gravitational lensing at z 6 — 8. However, if the evolution of the galaxy LF 

5 



continues into the reionisation era (we assume an extrapolation of the fitting formulae based on 
candidates discovered in and around the HUDF^), then we find that JWSTwill measure LFs that 
are significantly modified by lensing at redshifts above z ~ 14 and z ~ 10 in its ultra-deep and 
medium-deep surveys, respectively. 

Our results imply that while published LF's at 2; ^ 7 are not currently corrected for a potential 
gravitational lensing bias, such corrections will need to be prescribed in detail for future surveys 
that aim to measure the build-up of stellar mass among the first galaxies using JWST. In particular, 
studies of the high redshift LF will require good understanding of the magnification bias for high 
redshift galaxies, in order to correct for gravitational lensing and uncover its true unlensed shape at 
z ^ 12. Of particular importance will be the unknown evolution of M*, which could be influenced (for 
example) by supernovae feedback from population-Ill stars^^, in addition to hierarchical clustering 
and formation. Gravitational lensing could magnify z ^ 10-12 objects to flux levels that will allow 
spectroscopic observations using JWST and the largest ground-based near-IR spectrographs. A 
further implication of our analysis is that gravitational lensing could be used to probe the shape 
of the high redshift LF at luminosities that are not otherwise accessible^^, using the association of 
high redshift galaxy candidates and foreground galaxies, combined with careful modelling of the 
gravitational lensing bias. 
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Figure 1. Gravitational lens fractions among candidate high redshift HUDF galaxies. Panel a: The 

fraction of multiply-imaged high redshift galaxies. Panel b: The fraction of high redshift galaxies in which 
multiple images could be detected in the HUDF. Panel c: The probability distribution of image separations 
(at z ~ 10.6) relative to the nearest bright foreground galaxy, in the cases of random lines-of-sight (black 
dotted line), of gravitational lenses (black dashed line), and for composite distributions computed for two 
(faint) values of M^,. Also shown is the distribution of measured separations for twenty z ~ 10.6 candidates^ 
in the HUDF (stepped blue histogram). Lyman-break galaxy candidates have been selected with median 
redshifts of z ~ 6, z ~ 7, z ~ 8.6 and z ~ 10.6. At z ~ 6, candidate selection using the Advanced Camera 
for Surveys reaches^ = 30 mag (absolute magnitude Myn^ = —16.7 mag). At higher redshifts, objects 

in the WFC3 HUDF data can be selected'* to — 29.0 mag, corresponding to = —18.0, —18.3 and 

— 18.6 mag at z ~ 7, 8.6 and 10.6. The open squares correspond to lens fractions given the fitting formula^'^*^ 
~ —21 + 0.32 X (z — 3.8). The solid squares represent alternative estimates'*'^ of M^,. The model for 
gravitational lensing^^ is based on the velocity dispersion function of galaxies^''. Galaxy mass distributions 
are modelled as Singular Isothermal Spheres, and we assume a constant co-moving density of lenses. Elliptical 
lenses would not significantly alter the cross-section^*, but would provide additional images, and so increase 
the fraction of observed galaxies that are lensed. We assume a Schechter LF^^, with power-law slope* a = —2. 
A change of 0.3 in a leads to a 40% change in the lens probability. We have used the cosmology based on 
7-year results from the WMAP satellite^^ throughout this Letter. 
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Figure 2. Probabilities for multiple imaging of high redshift galaxies to be observed with JWST. 

The panels mirror those of Figure 1, but with examples of limiting magnitudes and redshifts appropriate for 
both an ultra-deep survey (toab < 31.4 mag, ^ InJy), and a medium-deep survey (toab < 29.4 mag) with 
JWST. The corresponding limiting absolute magnitudes are listed. Panels a-b: The fraction of observed 
galaxies that have multiple images. The superimposed solid and open points correspond to lens fractions given 
a faint value* of M^, = —17.8 at 2; ~ 8.6 and z ~ 10.6, and a fitting formula M^,{z) based on lower redshift 
data, respectively^'^^. The latter is extrapolated to high redshift where data does not yet exist. Panels 
c-d: The fraction of high redshift galaxies in which multiple images could be detected by JWST. Panels 
e-f: The probability distribution of image separations relative to the nearest bright foreground galaxy, in the 
cases of random lines-of-sight (black dotted line), and for composite distributions computed for values of M^, 
extrapolated from observations in the HUDF using the previously mentioned fitting formula^ We note that 
imaging surveys with JWST will be working at the diffraction limit 0.08 arcseconds resolution FWHM) at 
~ 2 /Ltm. This resolution is higher than is currently available in the HUDF near-IR images, where candidates 
have been selected in close proximity to bright foreground galaxies, and hence high redshift candidates will 
also be detectable close to foreground galaxies. 
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^AB (f^ag) 

Figure 3. Gravitational lens induced modification of the bright end of the high redshift galaxy 
luminosity function to be observed with JWST. Thin curves present the intrinsic LF {'^), and solid 
curves the observed LF following modification from gravitational lensing. For simplicity, a uniform magnifi- 
cation was assumed outside regions of sky that are multiply-imaged, with a value such that flux is conserved 
over the whole sky. The parameters describing the LF are extrapolated to high redshift, where data does not 
yet exist, assuming fitting formulae based on data from the HUDF^'"^^ . Of particular relevance are the values 
of Mi,, which are listed. The solid and open points show the luminosities and densities of the faintest galaxies 
to be observed with JWST, assuming limiting magnitudes appropriate for both an ultra-deep JWST survey 
(toab < 31.4 mag), and a medium-deep JWST suivey (mAB < 29.4 mag). The probability for gravitational 
lensing will become of order unity in the steep exponential parts of the LF at sufhciently high redshifts. This 
gravitational forest should not to be confused with the purely mathematical effects of image crowding that 
makes the detection and de-blending of faint objects harder at progressively fainter fluxes'^". These latter 
effects are referred to as either the instrumental confusion limit — when the instrumental resolution is not 
good enough to statistically distinguish all faint background objects from brighter foreground objects — or the 
natural confusion limit — when the instrumental resolution is good enough to distinguish faint background 
objects from brighter foreground objects, but the images are so deep that objects start overlapping because of 
their own intrinsic sizes. The HUDF and JWST images are in the latter regime^°, and as argued in this Let- 
ter, likely have the additional fundamental limitation that gravitational lensing will magnify a non-negligible 
fraction of faint objects into the sample. 
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SUPPLEMENTARY INFORMATION 

The LeMer to Nature estimates the probabihty for gravitational lensing among high rcdshift galaxies, 
with emphasis on current surveys using the HUDF, and future surveys to be carried out using 
JWST. The Letter also uses the observed distributions of separation between very high redshift 
galaxy candidates in the HUDF and foreground galaxies, to show that a significant fraction of these 
objects are likely to be gravitationally lensed. The following sections expand the brief descriptions 
of the modelling and interpretation that can be found in the Letter to Nature. 

1 Schematic picture of magnification bias and foreground galaxy 
correlation 

In Supplementary Figure 1, we present a schematic representation of a portion of the HUDF, which 
shows how magnification bias leads to a correlation between foreground galaxies and high redshift 
candidates. Panel a shows a representation of the Schechter function^^, which describes the luminos- 
ity function (LF) of high redshift galaxies. The limiting absolute magnitude and characteristic 
magnitude are shown for reference. Gravitational lensing magnifies sources relative to their 
intrinsic luminosity, and draws intrinsically faint galaxies into the flux limited sample. Since faint 
galaxies are much more common than bright galaxies, the number of sources per unit area in regions 
of lensing magnification is significantly higher. This leads to a bias of sources near foreground galax- 
ies. To illustrate this effect on the high redshift galaxy samples in the HUDF, we sketch in panel b a 
portion of the sky approximately 10 arcseconds across. In this panel, background sources (i.e. high 
redshift galaxies) arc shown in red and foreground galaxies (those near z ~ 1 — 2) in blue. The faint 
galaxies (with Mab > Afum) are signified by open symbols, while the closed symbols signify bright 
galaxies with Mab < -^Um- The black dotted disks denote regions of sky where background sources 
will be multiply-imaged by a foreground galaxy. For illustration, this schematic representation over- 
estimates the total lensing cross-section, which is ~ 0.5%, by a factor of ~ 10. The typical angular 
scale of these regions is 1 arc-second. We show those faint galaxies that lie within these lensing 
regions in green. In panel c, the faint galaxies that are close enough to bright foreground galaxies 
to be multiply-imaged (shown in green), producing in general a bright image with Mab < M\[^, 
and an undetected faint image with Mab > -^lim- Finally, the observed association of high redshift 
galaxies with bright foreground galaxies — once gravitational lensing bias has been accounted for — 
is shown in panel d. In this example 2 of the 5 observed high redshift galaxies {Mab < -A^iim) have 
entered the sample owing to gravitational magnification, and have close alignment with foreground 
galaxies as a result. In this case we find 40% of high redshift galaxies within ~ 1 arc-second of bright 
foreground galaxies, even though the observed density of bright foreground objects is 1 per 20 square 
arcseconds. We note that gravitational lensing can also lower the observed density of sources on the 
sky that have neighbouring foreground galaxies by magnifying the angular extent of the image plane 
relative to the source plane. This effect, which is usually referred to as depletion, is not dominant 
when the LF is steep, as is the case for high redshift galaxies. 

2 Lens model 

We refer to the a-priori probability for a galaxy at redshift Zgai to be multiply-imaged by an inter- 
vening foreground galaxy as the multiple image optical depth^^ 
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dz 



dz, (1) 



where 

^ = / daHa,z){l + zf'-^.Dl9Ua,z), (2) 

6er is the Einstein radius as a function of velocity dispersion a and redshift z, is the angular 
diameter distance to the lens, and t is time. To calculate Tm, we use the expression for the angular 
Einstein radius for a Singular Isothermal Sphere (SIS) 

where Dg and are the angular diameter distances to the source, and between the lens and source, 
respectively. 

To evaluate <I>(cr, z), we first assume^^ the Sloan Digital Sky Survey (SDSS) velocity dispersion 
function^'' <I*sdss(o') 

. N , , / c \ exp [— (cj/(T*)''l ^dcr 
*SDSS(<.M. = *.(-) ^4,^^/3-, (4) 

where $^ = 2 x lO^^Mpc-^, a = 2.32, /3 = 2.67 and a^, = 161km/s. We further assume that 
the lens population has a constant co-moving density ^{a,z) = ^*sDSs(o")- Although the density of 
galaxies must decline at high redshift, this approximation is reasonable, since most lensing occurs 
at z ^ 1.5. The uncertainty in predictions of the lens fraction owing to the unknown evolution of 
the velocity dispersion function is approximately a factor of two^^'^^ . We note that this prescription 
gives a lensing cross-section for z ~ 2 quasars that is consistent with the SDSS analysis^^, which is 
an observational requirement. The lens model assumes that galaxy velocity dispersions reach down 
to as low as cj = lOkm/s. However, as the lensing neighbours are selected by velocity dispersion, the 
distribution of lensed separations is not sensitive to the assumed cutoff, because the lens cross-section 
is proportional to velocity dispersion to the fourth power (o"^). 

We have utilised a simple lens model. In particular we have not included non-spherical lens 
distributions, which produce four rather than two image lenses in some cases. Indeed, empirical 
estimates for the fraction of quasar lenses that have four images of about 40% have been obtained 



from the homogeneous CLASS sample (http://www.aoc.nrao.edu/'~^siiiyers/class.htmip, and of 

about 15% from the Sloan Digital Sky Survey"^"^ . While the predicted four-image to two-image ratio 
depends on the ellipticity of the lensing galaxies'^^ , the ellipticity is found not to significantly influence 
the overall cross-section for multiple imaging^®'^^'^"^. On the other hand, the magnification bias can be 
larger for an elliptical lens, which would increase the expected multiple imaging rate^^. Moreover, the 
additional images in a four image lens would increase the fraction of observed candidates that are part 
of a multiply-imaged galaxy. Using spherical lenses for our estimates is therefore conservative with 
respect the expected influence of gravitational lensing on samples of high redshift galaxy candidates, 
both in terms of the number of lenses predicted and the association between high redshift candidates 
and bright foreground galaxies. We note here that the lens population for z ~ 8 — 10 candidates is 
at higher redshift than the lens galaxies responsible for the aforementioned samples. However the 
measured ellipticity distribution is nearly constant over a very wide range of flux and redshift^^ . 
Thus, we argue that since our simple model provides a good statistical description of the available 
data, neglecting elliptical lenses is reasonable, particularly given the range of other uncertainties. 
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2.1 magnification bias 



Flux limited samples are subject to magnification bias, which increases the relative probability that 
detected galaxies are gravitationally lensed-*^^, and concentrates sources in a flux limited sample 
around foreground objects^®. Yan et al.'^ have observed a number of z ~ 8 — 10 candidates that 
have neighbouring bright foreground galaxies. As discussed in the Letter, this correlation is likely to 
be the manifestation of these effects. The magnification bias for sources with observed luminosities 
between L and L + dL is 

while the corresponding overall magnification bias in a flux limited sample is 



-"lens — fOQ dP.-r.tTS ' 



where dP/djjL is the probability distribution for magnification (/x) within the range /imin < fJ- < /J-max.- 
Of relevance for high redshift surveys in the HUDF or with JWST (which have an angular resolution 
much better than the image separation^") is the magnification distribution for the brighter image 



dP, 



m,l 



d/X (/X-l)3 

We adopt a Schechter^^ function for the LF 



for 2 < /X < GO. (7) 



^(L)dL = *,(|-)"exp(-|-)g, (8) 

where is the characteristic density in Mpc"^, and a is the power-law slope at luminosities below 
the characteristic break at L*. Below, and in the Letter, we quote the characteristic luminosity in 
terms of the absolute magnitude = M + 2.5 log^o L/L^. 



2.2 gravitationally lensed luminosity function 

We note that in the presence of significant gravitational lensing, the LF can be modified from its 
intrinsic form^^, leading to a power-law slope at the bright-end of —3 (as shown in Figure 3 of 
the Letter). The modified LF can be estimated by modelling the overall magnification distribution 
using the probability distribution for magnification of multiply-imaged sources over a fraction Tj^^ 
of the sky, combined with a de-magnification /Udcmag = (1 — (A*muit)T'm)/(l — Tm) elsewhere. Here 
(/^muit) = 4 is the mean magnification of multiply-imaged sources, and /Xdemag tias been calculated in 
order to conserve flux on the cosmic sphere centred on an observer. The modified LF can then be 
approximated using the expression 

M/obs(^) = (1 - r^)^— ^(VWemag) + H dfi- + ^) ^{L/fi), (9) 

/^demag Jo H \ dfl djX J 

where dP-m.,2 / = 2/{fi + 1)^ for < /x < oo, is the probability distribution for the second image. 
We approximate the true magnification distribution by using a constant value of /x^emag in regions 
of no multiple imaging. This is valid for the modification of the LF at luminosities much brighter 
than M*, in which we are interested in this work. 
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3 Lensing predictions for high redshift surveys 



We summarise the predictions of our lensing model in Supplementary Figure 2. As shown in panel 
a, the lensing optical depth rises toward high redshift^^, and is 4-5 times as large for sources at z ~ 6 
as at z ~ 1.5. It doubles again from z = 6 to z = 20, so that at z > 10 the multiple imaging fraction 
is greater than 0.5%, even in the absence of magnification bias. Panel b shows the magnification 
bias as a function of the difference between and the survey limit in absolute magnitude Mum- 
At low redshifts, deep surveys can probe well below M*, so that the magnification bias is dominated 
by the power-law slope (a) of the Schechter function at low luminosities, and the resulting bias is 
of order unity. At very high redshifts, however, current surveys can only reach or even brighter, 
and hence the bias can be much higher (tens or hundreds) owing to the exponential nature of the 
LF sampled. We next combine the optical depth Tm with the bias -Biens,i to find the multiple image 
fraction Fiens = -Biens,iTm/(Acns,i''"m + (1 — Tm)), where we have assumed the bias of those galaxies 
which are not multiply-imaged to be unity. In panel c we plot contours of Fiens as a function of z 
and {Mi, — Mum). Surveys at low redshift {z ^3), with limits fainter than M*, should have multiple 
image fractions below 1%. However, at higher redshifts the lens fraction can be much higher. For 
example, a survey at z ^ 6 that reaches only 1 magnitude brighter than could have a lens fraction 
of 10%. Current and future surveys at z > 6 with HST and JWST lie in this upper-right portion 
of panel c. Only ultradeep surveys with JWST that reach well below iW^ at z ^ 10 will have their 
lensing fraction drop well below 10% again. 

4 High redshift galaxy candidate samples 

To compare the predictions of our model with samples of high redshift galaxy candidates, we inves- 
tigate samples from the HUDF compiled by Yan et al.^. These and other authors have employed 
the Lyman-break (or dropout) technique to select galaxies at z ^ 7 in the HUDF. We note that the 
major colour criteria used to select the samples of Yan et al.^ are very similar to those employed by 
other groups including Bouwens et al.^'^. However the overlap of individual candidates among the 
samples from these two teams is small. In particular, none of J-dropouts compiled by Yan et al.^ 
are among the three J-dropouts presented by Bouwens et al.-*^. There could be a range of reasons for 
this disjoint. With respect to our current work, we note that one reason for the difference in sample 
selection could be the choice of whether to include candidates near bright foreground objects. By 
construction, the samples of Yan et al.^ were not biased against regions around foreground objects, 
indicating that i/gravitationally lensed, multiply-imaged galaxies do exist in the HUDF at z 8— 10, 
then they would be selected. We therefore concentrate here on the predicted gravitational lensing 
statistics for these samples. 

The z ~ 8.6 sample used to discuss the gravitational lensing of galaxies in the HUDF as part of 
this work consists of 15 ^-dropouts (spanning the redshift range of 7.7 ^ z ^ 9.4), while the z ~ 10.6 
sample consists of 20 J-dropouts (spanning 9.4 ^ z ^ 11.8). These objects are all very faint, and 
have magnitudes ranging from Mab = 28.0 — 29.0. 

4.1 lensing predictions for 2; ~ 8 — 10 candidates 

We have calculated multiple-imaging probabilities for the z ~ 8 — 10 samples^ as a function of 
galaxy absolute magnitude assuming = —17.8 mag. These results can be used to discuss lensing 
probabilities for individual z ~ 8 — 10 dropout candidates^ in more detail. 

Panel a of Supplementary Figure 3 shows the probability that a galaxy with absolute magnitude 
Mab,i is multiply-imaged. At 2; ~ 6 — 7, only galaxies much brighter than Mab,i < —21 mag have a 
significant chance of being lensed. However, at z 2± 8 — 10 galaxies as faint as Mab,i — —19 mag have 
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a substantial lens fraction. Of course, these are just statements reflecting the relative brightness of 
Miim and M*. Our results suggest that a number of 2; ~ 8 — 10 galaxies detected in the HUDF should 
be multiply-imaged. On the other hand, we note that we have not identified any image pairs in the 
HUDF. Panel b shows the probability that a lensed galaxy with observed Mab,i has a corresponding 
second image with Mab,2 < -^lim; such that it is also detectable above the HUDF flux limit. For 
this probability to be large ( ^ 50%), the detected image must be more than ~ 1 mag brighter than 
Miim. Panel c shows the fraction of galaxies that are part of a lensed pair in which both images are 
detectable, [F^u = -Piens x P{Mab,2 < M\[^\Mab,i)]- We find that at z 2± 6 — 7, only galaxies that 
are several magnitudes brighter than have a reasonable chance (few-10%) of being observed as 
a multiple image system. However, at 2; ~ 8 — 10, this probability increases to ^ 10% for galaxies 
that are only a magnitude brighter than M^^. 

We note that the predicted fraction would increase if we modelled elliptical lenses which can have 
more than two images. We roughly estimate the fraction in this case by noting that a four-image 
lens typically has either two bright images of approximately equal magnification where the source is 
near a fold caustic, or three bright images with the central one having a magnification equal to the 
sum of the other two^^ . Thus, close to the detection limit, we expect either the two bright images, 
or only the brightest of three bright images would be detected for typical four-image lenses. We 
therefore argue that for the (empirically observed) 15-40% of cases where the lens has four images, 
the fraction of multiply-imaged systems in which more than one image is detected will increase by 
at most a factor of approximately two. 

In Supplementary Figure 3, we have superimposed squares to show probabilities for individual 
galaxy candidates in the HUDF'^ . We use = —17.8 mag estimated by Yan et al.^ as an example. 
By summing probabilities for individual galaxy candidates in the Yan et al.'^ sample, we calculate 
the (mean) expected number of lensed systems, finding {Ni^^s) = 0-8 ±0.1 and (Niens) = 1.7 it 0.2 
among the 15 and 20 candidates at z 2± 8.6 and z c± 10.6, respectively. If the true value is fainter, 
these numbers will be higher. A Poisson distribution with mean (Aliens) = 2.5 implies that at least 
one lens pair would be found among the observed 2; ~ 8 — 10 sample in 92% of cases, which stands 
in apparent contrast to the fact that no image pairs have been identified in the HUDF. However, we 
find the probability that a lensed galaxy with observed mAB,i has a corresponding second image with 
fnAB,2 < "iiim (i-e. detectable with the HUDF data) to be only ~ 10%, even for galaxies that are 
one magnitude brighter than M\\^. Here we neglect the caveat that secondary images could fall on 
top of the foreground galaxies, which would further reduce the chance of their being observed. We 
estimate that the number of systems that would be observed as doubles (i.e. both images detected) 
to be (iVdbi) = 0.2 ± 0.06 and (iVdbi) = 0.4 ± 0.1 at z = 8.6 and 10.6, respectively. A Poisson 
distribution with mean (Adbi) = 0-6 implies that the observed z ~ 8 — 10 sample would not contain 
any doubles in most (55%) cases. Thus, with M-^ = —17.8, we find that Aliens ~ 2 — 3 of the detected 
galaxies in each redshift range should be multiply-imaged, but do not necessarily expect any of these 
to be identified as multiple image systems. On the other hand, an even fainter value of = —17.3 
(—17.1) implies that at least one double would be observed in 90% (99%) of cases, imposing an 
upper-limit of ^ — 17 at z ~ 8 — 10. We note that the values of — as measured — could also 
be biased by gravitational lensing (see Figure 3 of the Letter). Currently, none of the published LFs 
at z ^ 7 are corrected for the potential lensing bias. However it is clear from the results presented 
in our Letter, that such corrections will need to be prescribed in detail in the future. 

The mean magnification of detected lensed images (with = —17.8) is (fi) ~ 6, indicating 
that gravitational lensing in these samples would lead to over-estimates of the luminosity density at 
z ~ 8.6 and z ~ 10.6 of ~ 50% and ~ 80%, respectively, if the magnification is neglected. Since the 
axis ratio of lensed images is equal to the magnification for an SIS, this also implies that the lensed 
images should be significantly elongated, and indeed some candidates appear to have this property^ . 
However, the signal-to-noise for the detected candidates is too low to draw quantitative conclusions. 
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5 Distribution of lensed separations 



As shown in the previous section, we find that in most cases only the more magnified image will be 
brighter than the detection threshold. We therefore calculate the expected distribution of angular 
separation between a lens galaxy and the brighter of the two images. The apparent angular separation 
of the bright image with magnification n from the center of a lensing SIS at redshift z < Zgai is 

A01ens(/^, ^) = (l + ^) ^Er(^). (10) 

Using this expression we evaluate the probability distribution for the separation of bright images of 
image pairs from the lensing galaxy 

dP /-^gal fOO roo dTyadP^l 

oc 



dAe 



/ dz di, dL^^^{L/fi)SaU[^e-Ae,Ul^,z)], (11) 

Jo J2 JLii^ dz d/J, 

where Sdii is the Dirac delta function, and is the unlensed luminosity corresponding to the survey 
flux limit. 



6 Observed correlation between high redshift candidates and fore- 
ground galaxies 

For comparison with the lensing predictions, we measure the distribution of separations between 
z 2± 8 — 10 candidates^ and their nearest bright (H < 25 mag) foreground galaxy. The red histograms 
in panels a of Supplementary Figures 4 and 5 show the cumulative distributions of this separation for 
the z ~ 8.6 and z ~ 10.6 candidates, respectively. Comparing the distributions in these two panels 
with the random line-of-sight and lensed predictions, two trends are obvious. Firstly, these z c± 8 — 10 
candidates arc observed to be closer to bright foreground galaxies than are random lincs-of-siglit. On 
the other hand, the candidates are found at larger separations from foreground galaxies than would 
be predicted if they were all multiply-imaged. Quantitatively, the Kolmogorov-Smirnov probabilities 
(Pks) between the observed distributions and the all-random model or the all-lensed model (labeled 
in the figure) indicate that either model is rejected at high significance. This suggests that a fraction 
of these candidates may be gravitationally lensed. 

For illustration, the thick black lines in panels a of Supplementary Figures 4 and 5 show the 
composite distributions corresponding to multiple image lens fractions of Ficns = 0.2 and 0.4, at 
z ~ 8.6 and 10.6, respectively. These provide an excellent fit to the data (-Pks values labeled in 
Supplementary Figures 4 and 5). Panels b of Supplementary Figures 4 and 5 show the differential 
distributions of the observed angular separations (red) , as well as the corresponding composite models 
(thick black) and the random distributions (dotted black) . The latter demonstrates that the largest 
observed separations can be attributed to a random distribution. 

We next examine the redshift distributions of the nearest bright foreground galaxies, using spec- 
trophotometric redshift estimates^^ . The cumulative distributions are compared in panels c of Sup- 
plementary Figures 4 and 5 for the z ~ 8.6 and 10.6 candidates, respectively. The red histograms 
are the distributions for the neighbours of the high redshift candidates, the dotted black lines are 
distributions for the neighbours of random lines-of-sight, and the dashed black lines are distributions 
for the expected gravitational lens redshift^^. The redshift distributions of the foreground galaxies 
associated with the full samples of 2; ~ 8 — 10 candidates cannot be differentiated from those associ- 
ated with random lines-of-sight. In addition, for the z ~ 10.6 case in particular, foreground galaxy 
rcdshifts are found not to be drawn from a lensed galaxy population. However, the lens angular 
separation cuts off sharply at A6' ~ 1.5 arcseconds. We therefore generate the distribution of red- 
shifts only for foreground galaxies found within < 1.5 arcseconds of the 2; 2± 8 — 10 candidates. 
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which are shown as the blue histograms in these two panels. These distributions are consistent 
with the distribution of gravitational lens redshifts, which supports the hypothesis that many close 
candidate-foreground galaxy pairs in this sample result from magnification bias. In panels c and d 
of Supplementary Figures 4 and 5, we show the model redshift distributions corresponding to the 
values Ficns = 0.2 and 0.4 for candidates at z ~ 8.6 and 10.6, respectively (thick black lines). These 
again provide an excellent fit to the data, which, when taken together with the correlation between 
high redshift and foreground galaxy positions, provides compelling evidence for a significant lens 
fraction among the z ^8 galaxy candidates, since these foreground galaxies were selected only on 
the basis of their alignment with high redshift candidates. 
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Supplementary Figure 1. Schematic representation showing how magnification bias leads 
to an association between foreground galaxies and high redshift candidates. Panel a: 

The Schechter LF of high redshift galaxies. Panel b: High redshift galaxies (red) and foreground 
galaxies (blue). Faint galaxies (those with M^b > Afiim) are signified by open symbols, while the 
closed symbols signify bright galaxies with M^b < Afum- The black dotted disks denote regions of 
sky where background sources will be multiply imaged by the foreground galaxy. Faint background 
galaxies that lie within these lensing regions are shown in green. Panel c: The lensed faint galaxies 
are multiply-imaged, producing a bright image with M^b < -A^Umi and an undetected faint image with 
Mab > -^lim- Galaxies located near the lines of sight to foreground galaxies that are not multiply 
imaged, are deflected to larger separations, resulting in a lowering of observed source density (an 
eflFect known as depletion). Panel d: The correlation of observed high redshift galaxies (solid red 
symbols) with bright foreground galaxies once gravitational lensing bias has been accounted for. 
The depletion eff'ect is opposite in sign to the correlation introduced through strong lensing, but is 
sub-dominant in the case of high redshift galaxies. 
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Supplementary Figure 2. Probabilities for multiple imaging of high redshift galaxies. Panel 

a: The lensing optical depth as a function of redshift. Panel b: The magnification bias as a function 
of the difference between and the limiting survey absolute magnitude M\[^. Three values of the 
faint-end LF-slope a are considered. Panel c: Contours of -Fiens as a function of z and {M^ — M\[^), 
assuming^ a = —2. 



Fraction of lensed galaxies Fraction of 2nd images above /W|jm Frac. of galaxies with 2 detected images 




Mab,i MaB.I M/^B,^ 

Supplementary Figure 3. Probabilities for multiple imaging of z ~ 8 — 10 galaxy candidates. 

Panel a: The probability that a galaxy with observed M^ba is multiply-imaged. The expected 
mean number of lenses ((A'^iens)) among the z ~ 8.6 and z ~ 10.6 candidates is listed. Panel b: 
The probability that a lensed galaxy with observed Mab,i has a corresponding second image with 
^AB,2 < -^lim- Panel c: The fraction of galaxies that are part of a lensed pair in which both 
images are detectable (68% errors here were computed using a bootstrap method). The expected 
mean number of systems that would be observed as doubles ((A'dbi)) is listed. We have assumed the 
determinations of = —17.8 and a = —2, and observed absolute magnitudes Ma_b,i from Yan et 
al.^ (the squares correspond to probabilities for the individual galaxy candidates). 
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Supplementary Figure 4. Probability distributions for angular proximity and redshift of 
bright foreground galaxies among the sample of z ~ 8.6 candidates. Panel a: The cumula- 
tive distribution for the angular separation between z ~ 8.6 candidates and their nearest foreground 
galaxies with i7 < 25 in the HUDF (red histogram). Also shown are the model cumulative distribu- 
tions of angular separations between random lines-of-sight and the nearest bright foreground galaxies 
(dotted black line), and of angular separations for the brighter image of gravitationally lensed objects 
at z = 8.6 (dashed black line). The thick black line shows the composite cumulative distribution 
generated by summing the random and lensed histograms, with a weight equal to a lens fraction of 
F\ens = 0-2. Panel b: The binned histograms (area normalised to unity) for the angular separations 
of observed candidates (red), for separations in the composite model (thick black), and for separa- 
tions from random lines of sight (dotted black). Panel c: The cumulative redshift distribution for 
foreground galaxies associated with z ~ 8.6 candidates (red histogram). Also shown are the cumu- 
lative distributions for the redshifts of foreground galaxies nearest to random lines of sight (dotted 
black line), and for the expected gravitational lens redshifts assuming sources at z = 8.6 (dashed 
black line). The thick black line shows the composite cumulative distribution (-Ficns = 0.2). We also 
plot the redshift distribution of foreground galaxies within 1.5 arcseconds of a z ~ 8.6 candidate 
(blue histogram). Panel d: The binned histograms for the foreground galaxy redshifts along lines 
of sight to dropout candidates (red), and for the composite model (black). In each case, values of 
Pks corresponding to the comparison of the data with the model distributions are listed. 
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Supplementary Figure 5. Probability distributions for angular proximity and redshift of 
bright foreground galaxies among the sample of z ~ 10.6 candidates. The panels mirror 
those of Supplementary Figure 4. We assume -Fiens = 0.4 for the model composite distribution. 
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